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Abstract 

We introduce a new paradigm for finite precision iterative decoding on low-density parity-check 

H 

codes over the Binary Symmetric channel. The messages take values from a finite alphabet, and unlike 

i traditional quantized decoders which are quantized versions of the Belief propagation (BP) decoder, 

y—i the proposed finite alphabet iterative decoders (FAIDs) do not propagate quantized probabilities or 

> 

log-likelihoods and the variable node update functions do not mimic the BP decoder. Rather, the 

o 

update functions are maps designed using the knowledge of potentially harmful subgraphs that could 
[» — be present in a given code, thereby rendering these decoders capable of outperforming the BP in the 

o 

£N| error floor region. On certain column-weight-three codes of practical interest, we show that there exist 

3-bit precision FAIDs that surpass the BP decoder in the error floor. Hence, FAIDs are able to achieve 
^ a superior performance at much lower complexity. We also provide a methodology for the selection 
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of FAIDs that is not code-specific, but gives a set of candidate FAIDs containing potentially good 
decoders in the error floor region for any column-weight-three code. We validate the code generality of 
our methodology by providing particularly good three-bit precision FAIDs for a variety of codes with 
different rates and lengths. 

I. Introduction 

At the heart of modern coding theory lies the fact that low-density parity-check (LDPC) codes 
[[TJ can be efficiently decoded by message-passing algorithms which are based on the belief 
propagation (BP) algorithm pj. The BP algorithm operates on a graphical model of a code 
known as the Tanner graph, and computes marginals of functions on the graph. While inference 
using BP is exact only on loop-free graphs (trees) and exact inference on loopy graphs is hard 
even under strong restrictions of graphical model topology, the BP still provides surprisingly 
close approximations to exact marginals on loopy graphs. 

However, the sub-optimality of BP on loopy graphs contributes to the error floor phenomenon 
of LDPC codes. The error floor is an abrupt degradation in the slope of the error-rate perfor- 
mance in the high signal-to-noise-ratio (SNR) regime, where certain harmful loopy structures, 
generically termed as trapping sets [|3j present in the Tanner graph of the code, cause the decoder 
to fail. Although there have been some important works related to devising algorithms that can 
provide better approximations on the marginals such as the generalized BP algorithms of [ffl, 
these are still too complex for practical use. Moreover, the effects of finite precision that are 
introduced when decoders are realized in hardware can further contribute to the error floor 
problem. A glimpse at the iterative decoders developed so far reveals a wide range of decoders 
of varying complexity. The simple binary message passing algorithms such as the Gallager A/B 
algorithms [[TJ occupy one end of the spectrum, while the BP lies at the other end. The gamut 
of decoders filling the intermediate space can simply be understood as the implementation of 
the BP (and variants) at different levels of precision. 
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In this paper, which serves as the first part of our two-part paper series, we introduce a novel 
approach to the design of finite precision iterative decoders for the Binary Symmetric channel 
(BSC) which we refer to as finite alphabet iterative decoders (FAIDs) to signify the fact that 
the messages belong to a finite alphabet. Some of the key features that clearly distinguish our 
approach from all the other existing works on finite precision iterative decoders (which shall 
be discussed in greater detail in the next section) are: 1) the messages are not quantized values 
of log-likelihoods or probabilities, and 2) the variable node update functions are simple well- 
defined maps rather than approximations of the update functions used in BP. The maps for 
variable node update in FAIDs are designed with the goal of increasing the guaranteed error- 
correction capability by using the knowledge of potentially harmful subgraphs that could be 
present in any given code, thereby improving the slope of the error floor on the BSC ||5). Since 
the variable nodes in the proposed decoders are now equipped to deal with potentially harmful 
neighborhoods, which is in contrast to BP which treats the loopy Tanner graph as a tree, the 
proposed decoders are capable of surpassing the floating-point BP in the error floor. 

We restrict our focus to FAIDs for column- weight-three codes. The main reason for this is 
that such codes, while enabling extremely simple hardware implementations, are notoriously 
prone to higher error floors especially at moderate to high code rates compared to codes of 
higher column- weights. Being able to design good yet simple decoders for these codes not only 
validates our novel approach, but also further ascertains the importance of addressing the error 
problem from the viewpoint of improving the iterative decoding rather than from the viewpoint 
of code construction [6], as it becomes very difficult to construct such codes to be devoid of 
certain harmful loopy graphs or with large girth without compromise on the code rate and length. 

We also provide a semi-heuristic -based selection method that is not code specific, but gives a 
set of candidate FAIDs that contains potentially good decoders in the error floor for any given 
column-weight-three code. The method relies on analyzing the behavior of different FAIDs on 
a small number of carefully selected subgraphs which are potential trapping sets with errors 
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introduced in the nodes of the subgraph. In order to carry out this analysis, we introduce the 
notion of noisy trapping set, which is a generalized notion that takes into account the possible 
effects of a neighborhood by initializing different possible sets of messages into the trapping set. 
Using this approach, we obtain a set of particularly good 3-bit precision FAIDs that are each 
capable of surpassing BP on several column- weight-three codes. 



II. Finite precision iterative decoding and the error floor: prior work 

There have been several important works related to the design of quantized BP decoders. 
Early works include the Gallager-E and other finite precision decoders proposed by Richardson 
and Urbanke [FT), and reduced-complexity BP decoders such as the normalized min-sum and 
offset min-sum decoders proposed by Chen et al. [8] and by Fossorier et al. [9|. More recent 



works include the quantized BP decoders proposed by Lee and Thorpe [ 10 1, and also by Kurkosi 



and Yagi [ 11 1. In all of the aforementioned works, the quantization schemes are designed based 
on optimizing for the best decoding threshold on a given code using the asymptotic technique 
of density evolution (DE) [7| with the primary goal of approaching the performance of the 
floating-point BP algorithm. Since asymptotic methods are inapplicable to finite length codes as 
they do not to take their particular structures into account, these decoders designed for the best 
DE thresholds do not guarantee a good performance on a finite length code especially in the 



high SNR region. This was also evidenced in [12|, [14| where the FAIDs that were chosen for 



a given code solely based on their DE thresholds were not the best performing FAIDs and in 
some cases even exhibited high error floors. Moreover, the effects of quantization on the error 
floor can vary depending on the particular structure of a given code [[3j, [15|. 



The error floor problem of LDPC codes has gained significant attention over the past several 
years with several works addressing the problem from a decoding perspective by proposing 
modifications to the BP decoding algorithm. Some of the notable works include augmented 



belief propagation [16|, informed dynamic scheduling [17|, multi-stage decoding [18|, averaged 
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decoding |T9J, and use of post-processing to lower the error floors p0| , Q2TJ. While all these 
schemes certainly provided performance enhancements in the error floor, all of them require either 
a considerable increase in decoding complexity due to the modifications and post-processing 
or are restricted to a particular code whose structure is well-known. In addition, they do not 
take finite precision into account and this can drastically affect the performance gains when 



implemented in hardware due to possible numerical precision issues [23 1. 

Therefore, addressing the error floor problem while also taking finite-precision into account in 
the decoder design is of prime importance especially with emerging applications in communica- 
tion and data storage systems now requiring very low error-rates and faster processing speeds. 
In this regard, there are some relevant works worth mentioning. Zhao et al. in (22j proposed 
several modifications to the offset min-sum decoder while taking quantization into account. Their 
proposed quantization schemes enabled their offset-min sum decoder to approach performance 
of floating-point BP algorithm on the Additive White Gaussian Noise channel (AWGNC) with 



six bits of quantization with possible improvement in the error floor. Zhang et al. in [23 [ also 
studied the effects of quantization on the error floors of BP over the AWGNC, and designed 
schemes that led to substantial error floor reductions when six bits of quantization were used. 

However, using our novel approach, which is different from the above mentioned works (due 
to the reasons mentioned in Section [I]), we are able to design simple 3-bit precision decoders 
that are capable of surpassing the floating-point BP. 

III. Preliminaries 

Let G denote the Tanner graph of an (N, K) binary LDPC code C of rate R = K/N, 
which consists of the set of variable nodes V = {t>i, • • • ,v^} and the set of check nodes 
C = {ci, • • • , cm}- The degree of a node in G is the number of its neighbors in G. A code C 
is said to have a regular column-weight d v if all variable nodes in V of have the same degree 
d v . The set of neighbors of a node v.- t is denoted as AT(vi), and the set of neighbors of node Cj 
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is denoted by jV(cj). The girth of G is the length of shortest cycle present in G. 

Let x = (xi, x 2 , ■ ■ ■ , xn) denote a codeword of C that is transmitted over the BSC, where %i 
denotes the value of the bit associated with variable node v„ and let the vector received from 
the BSC be r = {n, r 2 , . . . , r^}. Let e = (ei, e 2 , . . . , ejv) denote the error pattern introduced 
by the BSC such that r = x © e, and © is the modulo-two sum. The support of an error vector 
e = (ei, e 2 , • • • , ejv), denoted by supp(e), is defined as the set of all positions i such that ^ 0. 
Let y = (j/i, 2/2, . . . , t/jv) be the input to the decoder, where each is calculated based on the 
received value ?v We shall also refer to the values j/j as channel values. During the analysis of 
decoders, we shall assume that the all-zero codeword was transmitted. This is a valid assumption 
since the decoders we consider are symmetric |7). 

A trapping set (TS) denoted by T(y) (as originally defined in pj) is a non-empty set of 
variable nodes that are not eventually corrected for a given decoder input y. If T(y) is empty, 
then the decoding is successful. Note that T(y) will depend on the number of decoding iterations. 
A common notation used to denote a TS is (a, b), where a = |T(y)|, and b is the number of 
odd-degree check nodes in the subgraph induced by T(y). 

Let T(a,b) denote the topology associated with a (a, b) TS, which is a graph consisting of 
a variable nodes and b odd-degree check nodes. A TS is said to be elementary if T contains 
only degree-one or/and degree-two check nodes. Throughout this paper, we restrict our focus to 
elementary trapping sets, since they are known to be dominant in the error floor [[3j, [24 1. Also, 



whenever we refer to a TS, we will implicitly refer to its underlying topological structure T. 

IV. Finite alphabet iterative decoders 
We shall now introduce a new type of finite precision decoders which we refer to as FAIDs 



1 12), (13). An N s -level FAID denoted by D is defined as a 4-tuple given by D = (M, y, $ c ). 
The finite alphabet M. defined as Ai = {—L s , . . . , — L 2 , —Li, 0, Li, L 2 , . . . , L s }, where Li 6 M + 
and Li > Lj for any i > j, consists of N s = 2s + 1 levels for which the message values are 
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confined to. The sign of a message x G M can be interpreted as the estimate of the bit associated 
with the variable node for which x is being passed to or from (positive for zero and negative 
for one), and the magnitude \x\ as a measure of how reliable this value is. 

The set y denotes the set of all possible channel values. For FAIDs over the BSC, y is defined 
as y = {±C}, where C G M + , and the value y-i E y for node Vi is determined by yi = (— l) ri C, 
i.e., we use the mapping — > C and 1 — > — C. Note that for the BP and min-sum algorithms 
(where the messages are log-likelihoods), the decoder input y is a real-valued vector (y = R). 
Let mi, • • • ,m;_! denote the extrinsic incoming messages to a node with degree /. 

A. Definitions of the update functions $„ and $ c 

The function $ c : M. dc ~ x — > M used for update at a check node with degree d c is defined as 



Note that this is the same function used in the min-sum decoder, and hence the novelty in the 
proposed decoders lies in the definition of the variable node update function 

The function : y x M 4 "^ 1 — > M. is the update function used at a variable node with 
degree d v , and is defined in closed form as 



where the function Q(.) is defined below based on a threshold set 3? = {Tj : 1 < i < s + 1} 
with Ti G R + and T { > Tj ifi>j, and T s+1 = oo. 



The weight Ui is computed using a symmetric function Q : M dv ^ 1 — > {0, 1}. Based on this 
definition, the function & v can be classified as a linear-threshold (LT) function or a non-linear- 
threshold (NLT) function. If SI = 1 (or constant), i.e., if the value of Ui is always 1 (or constant) 




(1) 






otherwise 
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for all possible inputs of f2, then is an LT function and a FAID with such a <fr v is classified 
as an LT FAID. Else, $ v is an NLT function and a FAID with such a $ v is an NLT FAID. 

Note that for an LT FAID, takes a linear combination of its arguments and then applies 
the function Q to determine its output. Therefore, will always output the same value for any 
possible set of incoming messages for a given y, h if their sum remains the same. For example, for 
a node with d v = 3, C, m 1; m 2 ) = & v (— C, m 3 , m 4 ) when mi+m 2 = ""23 + 7724. This is also 
a typical property present in existing quantized decoders such as quantized BP and min-sum. 

On the other hand, for an NLT FAID, <3>„ takes a non-linear combination of its arguments (due 
to fi) before applying the function Q on the result. Therefore, can output different values 
even for distinct sets of incoming messages that have the same sum. For instance, consider a 
map for a node with d v — 3 such that $„(— C, — L 3 , L 3 ) = and $^(—0, — L 2 , £2) = — -ki- 
ln this case, the two distinct sets of incoming messages are {— L 3 ,L 3 } and {— L 2 ,L 2 }, and 
the sums are zero for both the sets. However, $^ still gives different outputs for each of the 
sets namely, and —L\ respectively. Hence these decoders are different from existing quantized 
message-passing decoders. Note that the function & v satisfies the following two properties. 

Definition 1 (Symmetry property). A FAID is a symmetric decoder if its update function & v 
satisfies $ v (y i: m u m dv ^) = -^ v (-y h -m u -m^-i). 

Definition 2 (Lexicographic ordering). A FAID is said to be lexicographically ordered if $ w 
satisfies $„(-C, mi, ... , m^-i) > $„(— C, m' 1; . . . , m' dv _ 1 ) Vm^ > m-, i e {1, . . . , d v - 1}. 

The property of lexicographic ordering ensures that the for a given channel value, the output 
is always non-decreasing with increase in the values of incoming messages. For instance, a map 
where <&„(— C, L x , L 2 ) = L 2 and C, L 2 , L 2 ) = Li is forbidden, since $^(—0, L 2 , L 2 ) > 

C, Li, L 2 ). This is a typical property also present in existing message-passing decoders. 
The decision rule used in FAIDs at the end of each iteration, to determine the bit value 
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corresponding to each node Vi, is simply the sign of the sum of all incoming messages plus the 
channel value (positive implies zero and negative implies one). 

It is evident from the definition that can be uniquely described either by assigning real 
values to the elements of At, 2? and y, and defining f2, or by providing a set of constraints 
which the assigned values can take. As examples, we provide the closed-form description of $„ 
for a 5-level NLT FAID and a 7-level LT FAID defined for column- weight- three codes. 

Example 1 (5-level NLT FAID). The constraints on the values assigned to elements of Ai and 
3? that describe this map are: C = L±, L 2 = 3Li, T\ = L\, T 2 = L 2 , and the function Q is 
given by a;, = fi(mi, m 2 ) = 1 — (sign(mi) © sign(m 2 )) • 5(|mi| + |m 2 | — 2L 2 ). 

Example 2 (7-level LT FAID). The constraints on the values assigned to elements of Ai and 
2? that describe this map are: L\ < C < 2L\, L 2 = 2L±, L 3 = 2L 2 + C, and T\ = L\, T 2 = L 2 , 
and T3 = L3 — C, where Q = 1 since it is an LT function. 

Note in Ex. [TJ sign(x) = 1 if x < 0, and sign(x) = otherwise. Also note that although the 
rule defined in Ex. [2] appears to be similar to a quantized min-sum decoder, the messages in this 
decoder are not quantized probabilities or log-likelihoods. 

B. Describing the maps of Q v as arrays 

Let us alternatively define M to be M = {.U ; . M>. ■ ■ ■ ,M Ns } where Mi = -L s , M 2 = 
—L B -i,' ■ ■ , M s = —L\, M s+ i = 0, M s+2 = L 2 ,- ■ ■ , M Ns = L s . Then, $ v can be defined using 
-dimensional arrays or look-up tables (LUTs) rather than as closed-form functions, which 
enables simple implementations and also may be more convenient for decoder selection. 

For column-weight-three codes, the map specifying $„ is a simple two-dimensional array 
defined by [k,j]i<i<N s ,i<j<N s , where kj G M, such that C, M i: Mf) = Zy for any Mi, Mj G 
Ai. The values for $„(C, Mi, Mj) can be deduced from the symmetry of Table [j] shows an 
example of a defined as an array for a 7-level FAID when jji = — C. 
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It is easy to see that if the diagonal entries in the array [k,j}i<i<N s ,i<j<N s are different, then $„ 
must be an NLT function (as discussed previously). However, § v can still be an NLT function 
even if all the diagonal entries in the array are the same, as highlighted in the following lemma. 

Lemma 1. If C, — L x , L 2 ) = -L x , $„(-C,0,Li) = 0, $„(— C, 0, -L 2 ) = -L 3 , and 
$ v (— C, — Li, — L\) = —L 2 , then can not be expressed as a linear-threshold function, and 
hence it is a non-linear-threshold function. 

Proof: See Appendix ■ 
Consequently, the map defined by Table [I] is an NLT function. Fig. [T] shows the error-rate 
performances of the 5-level NLT and 7-level LT FAIDs defined in the two examples, and the 7- 
level NLT FAID defined by the Table [I] along with the floating-point BP and min-sum decoders on 
the well-known (155,64) Tanner code. All decoders were run for a maximum of 100 iterations. 
From the plot, we see that all the FAIDs significantly outperform the floating-point BP and 
min-sum on the code. We will soon provide the methodology used to identify good FAIDs. 

Note that a particular choice of [k,j]i<i<N s ,i<j<N s gives rise to a particular $„, and the choice 
must ensure that the both properties of & v are satisfied. For the remainder of the paper, we shall 
restrict our discussion to only class-A FAIDs. A natural question that arises at this point is how 
many class-A FAIDs exist. This can be easily enumerated by establishing a connection between 
class-A FAIDs and symmetric plane partitions. 

C. Symmetric plane partition representation of & v 

A symmetric plane partition tt is an array of nonnegative integers (^i,j)i>ij>i such that TTjj > 
iti+ij, 7Tjj > 7Tjj + i Vi, j > 1, and iz^j = 7^. If itij = Vz > r, Vj > s, and TTij < t Vi,j, then 
the plane partition is said to be contained in a box with side lengths (r, s,t). The value tt^ is 
represented as a box of height ir i; j positioned at (i, j) coordinate on a horizontal plane. 

Due to the imposition of the lexicographic ordering and symmetry of $„, there exists a 
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bijection between the array [k.j]i<i<N s ,i<j<N s and a symmetric plane partition contained in a 
(N s x N a x N s — 1) box, where each 7Ty is determined based on Fig. [2] shows the visualization 
of a plane partition corresponding to & v of the 7-level FAID defined in Table |IJ Kuperberg in 
(25) gave an elegant formula for the enumeration of symmetric plane partitions contained in a 
box, and we can directly utilize this for the enumeration of class-A iVs-level FAIDs as well. 

Theorem 1 (Number of Class-A A^-level FAID). The total number Ka(N s ) of symmetric 
lexicographically ordered N s -level FAIDs is given by 

K . ._ H 2 $N a )H 1 {N a )H 2 {N a -\) 
A{ s) H 2 (2N S + 1)^(2^-1) 

where H k {n) = (n — k) \ (n — 2k)\ (n — 3k) \ ... z'^ the staggered hyperfactorial function. 

Proof: The proof of the theorem follows from the bijection between the map $> v of a class-A 
FAID and a symmetric boxed plane partition. ■ 
The total number of class-A FAIDs for N s = 5 and A^ s = 7 levels are 28,314 and 530,803,988 
respectively. Henceforth, we shall restrict our discussion to only class-A FAIDs. 

V. Selection of finite alphabet iterative decoders 

It is evident from the previous section that identifying particularly good FAIDs from the set 
of all possible class-A FAIDs is highly non-trivial since the number of such FAIDs is still large. 
We now describe a general approach that can be used to identify a subset of candidate A^ s -level 
FAIDs, one or several of which are potentially good for any column-weight-three code. Our main 
aim behind this approach is to restrict the choice of FAIDs to a possibly small subset containing 
good candidates. Given a particular code, it would then be feasible to identify the best performing 
FAID from this subset by using brute-force simulation or emulation or some other technique on 
the code. Moreover, since the performance of a FAID on a given code depends on its structure, 
the goal of identifying several candidate FAIDs is more realistic than identifying a single good 
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FAID, and allows for devising a selection method that is not code- specific. Another important 
objective of our approach is to ensure that any FAID belonging to this subset is capable of 
surpassing BP in the error floor not just on a single code but on several codes. 

The approach we use relies on the knowledge of potentially harmful subgraphs that could 
be trapping sets for traditional iterative decoders when present in a given code. The candidate 
FAIDs are chosen by analyzing their behavior on each of these subgraphs with errors introduced 
in them. We will first introduce some important notions that form the basis of our approach, and 
then subsequently present a methodology for FAID selection for column- weight-three codes. 

A. Critical number and isolation assumption 

The notion of critical number associated with a TS of type T(a, b) was originally introduced 



for Gallager-A/B algorithms on the BSC [26|. It is computed by analyzing the Gallager-A/B 
decoding on errors contained in the topology T that is present in a code, assuming that all nodes 
outside the topology are initially correct. It provides a measure of how harmful a TS is, and 



hence, this notion is not only useful for predicting the error floor performance [ 24 ] but also for 
determining the harmful subgraphs that should be avoided in the code designs. 

In order to be able to extend the notion of critical number for FAIDs, we introduce the notion 
of isolation assumption p8| which is used to analyze the decoder on a potential TS T(a,b). 
Under this assumption, the neighborhood of the TS is such that the messages flowing into the 
TS from its neighborhood are not in any way influenced by the messages flowing out of the TS. 
Therefore, the messages flowing into the TS can be computed while completely disregarding the 



neighborhood [28 Theorem 1]. We shall now precisely define this notion. 

Let ^l k (G) denote the computation tree corresponding to an iterative decoder on G enumerated 
for k iterations with node v j £ V as its root. A node w £ !J k {G) is a descendant of a node 
u £ 3?i k {G) if there exists a path starting from node w to root Vi that traverses through node u. 

Definition 3 (Isolation assumption). Let H be a subgraph of G induced by P C V with check 
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node set W C C. The computation tree 3^ k (G) with the root Vi E P is said to be isolated if for 
any node u PUW in 2f k {G), u does not have any descendant belonging to PUW. If £^ k (G) 
is isolated Vt> , G P, then H is said to satisfy the isolation assumption in G for k iterations. 

Remark: The above definition is a revised version of the one given in [ |28| . 

Note that the isolation assumption is weaker than Gallager's independence assumption as 
explained in [28|. The critical number can now be defined in the framework of FAIDs. 

Definition 4. The critical number of a FAID denoted by D on a subgraph H is the smallest 
number of errors introduced in H for which D fails on H under the isolation assumption. 

Remark: We set the critical number to oo if D corrects all possible error patterns on H. 

The critical number can now be used as possible parameter for decoder selection where a 
decoder is chosen to maximize the critical number on a given TS(s). In principle, one could 
consider a database of potential TSs that are generated either through analytical or empirical 
evaluations of traditional decoders such as BP and min-sum on several different codes, and then 
select a FAID based on its critical numbers on all these TSs. 

However, the isolation assumption of a TS typically does not hold in an actual code for 
more than few iterations and hence the critical number may not reflect the true error-correction 
capability of the FAID on a code containing the TS. This is especially true for TSs of small 
sizes. Therefore, unless a very large database of TSs is considered or unless TSs with large sizes 
are considered such that isolation assumption holds for many more iterations (as done in [29] 
where the considered TSs are the weight-20 codewords), the strategy will remain ineffective. This 
motivates the need for a new notion that considers to an extent the influence of the neighborhood. 

B. Noisy trapping sets and noisy critical numbers 

Let us consider a harmful topology T(a, b) that has been identified as a potential TS on a 
given code. We introduce the notion of initialization vector which allows us to partially capture 
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the influence of its arbitrary (unknown) neighborhood during the analysis of a FAID on the T. 

Definition 5. An initialization vector on a TS T(a,b) is defined as a vector @ = (6i,...,6b) 
where Qi G Ai, such that during the message passing of a FAID on T, the message passed by 
the i th degree-one check node in any iteration is The TS T(a, b) is said to be initialized by 
such a vector and is referred to as a noisy trapping set. 

A FAID can now be analyzed by introducing errors into the variable nodes of the TS T(a, b) 
and passing messages iteratively on the edges of T under a given initialization vector. Note that 
the initialization vector on a TS is carried out only through the degree-one check nodes, and 
also that the initialization vector is not iteration-dependent. 

As an example, Fig. [3] depicts how a FAID is analyzed for a three-error pattern on a T(6, 2) 
initialized by a vector © = (9i, 9 2 ). A • denotes a variable node initially wrong (vi, v 2 , and V4) 
and a O denotes a node initially correct (t> 3 , t> 5 , and v 6 ). A □ denotes a degree-two check node 
and a ■ denotes a degree-one check node. Initially all the messages passed by all nodes except 
the degree-one check nodes are set to zero. Then the messages are iteratively updated using the 
maps $ v and <3> c by treating the topology T as if it were the Tanner graph of a code but with 
the exception that a degree-one check node sends 9\ (or 9 2 ) to its neighbors in all iterations of 
the message passing. The message update on a single edge from a variable node is shown in the 
figure for each of the nodes v\, v 2 , V3 , and v$ (t>4 and vq are similar to v 2 and v 3 respectively). 
Note that the messages mi, m 2 , . . . , m 6 denote the extrinsic incoming messages to these nodes. 

Let Nj denote the maximum number of iterations allowed for message passing under a 
particular FAID on TS T(a, b). We examine whether an error pattern is corrected by the FAID 
within Nj iterations under a given initialization vector on the TS T(a,b). 

Our main intuition for defining such a notion is as follows. Let us consider a code whose graph 
G contains a subgraph H that is isomorphic to the topology T(a,b). Assume that a particular 
FAID is being used for decoding an error pattern where some (or all) of the variable nodes in 
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H are initially in error and the nodes outside H are initially correct. During each iteration of 
decoding, different possible messages belonging to M. will be passed into the nodes of H from 
outside of H depending on its neighborhood. The initialization vector can be considered as a 
possible snapshot of the messages entering H through its check nodes in some arbitrary iteration, 
and different initializations represent the different possible influences that the neighborhood of 
H can have. Therefore, analyzing the FAID under different initializations on a given T can 
provide a good indication of its error correction capability on a code whose graph contains H. 

Although the initialization vector should ideally be iteration-dependent and include all mes- 
sages passed to all check nodes of T(a, b) from outside of T(a, b), this would make analyzing 
a FAID on T(a, b) computationally intractable. Therefore we only include constant values that 
are passed to degree-one check nodes into the initialization vector. We now define the notion of 
noisy critical number which is an extension of the notion of critical number for FAIDs. 

Definition 6. The noisy critical number of a FAID D under an initialization vector © on a TS 
T(a, b) is the smallest number of errors introduced in T(a, b) for which D fails on T(a, b). 

By determining the noisy critical number under every possible initialization vector © e Ai b 
on the TS T(a, b), a vector of noisy critical numbers, referred to as noisy critical number vector 
(NCNV), can be obtained for a particular FAID. Let N® denote the number of all possible 
initialization vectors, i.e., N® = \M b \. The NCNV of a FAID denoted by D on a given TS 
T(a, b) is given by <Av(T(a : b), A r /) = (Ci, C2, • • • , (n ), where Q is the noisy critical number 
determined under a initialization vector @ ; e M b on TS T(a, b) with N I being the maximum 
number of decoding iterations. The NCNV can now be used as a parameter for decoder selection. 

C. Choice of trapping sets for decoder selection 

Since our approach for identifying good FAIDs relies on determining the NCNVs of FAIDs 
on different trapping sets, the first step in the decoder selection is to carefully select the harmful 
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topologies that should be considered for the analysis. The selected trapping sets should be 
topologies that are known to exist in practical high-rate codes with dense graphs and are regarded 
as relatively harmful for existing iterative decoders. Also the trapping sets used should have 
notable differences in their topological structures, so that the candidate FAIDs identified from 
the analysis are more likely to be good on several codes rather than just on a single code. 



We use the trapping set ontology (TSO) [27| to determine which harmful topologies to 



consider. The TSO is a systematic hierarchy of trapping sets that is based on their topological 
relations, and it is specified in the form of a parent-child relationship between the trapping sets. 
A trapping set 7i is said to be a parent of a trapping set T2 if 7i contains 7i. For the decoder 
selection, the trapping sets are chosen such that they do not have many common parents, and 
that most of the parents (graphs of smaller size) in the TSO are considered. For simplicity, we 
ensure that all the trapping sets selected have the same value of b, so that the NCNVs determined 
from different trapping sets all have the same dimension. 



D. Decoder domination 

Having selected the harmful topologies, the next step in the decoder selection is to determine 
and be able to compare the NCNVs of different FAIDs on all the selected TSs. We introduce 
the notion of decoder domination in order to compare the NCNVs of different FAIDs. 

Let the set of chosen TSs for the analysis of FAIDs be denoted by A = {71, T2, ■ ■ ■ , Tn a } with 
cardinality N\. Let T = {Di, . . . , D^} denote the set of class-A N s -level FAIDs considered 
for possible decoder selection with cardinality Njr. Let ^D fc (7j, Nj) denote the NCNV of a 
FAID D t G J determined on a TS Tj G A, and let ^A^ 1 ^ (Tj, Nj) denote the i th component of 
the NCNV, i.e., JV^faNi) = &. 

A FAID D fc is said to dominate a FAID D; for a given initialization vector © ; , if 

■^{T^Nj) > ^(T 3 ,Nj) VjG{l,2,...,iV A } (3) 
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In other words, D k dominates D/ under a given initialization vector © ; if the noisy critical 
number of D fc is not less than the noisy critical number of D/ on all the TSs in A. 

The number of initialization vectors under which D fc dominates is denoted by h(D k , D;) 
and is given by 



where 1 is the indicator function that outputs a one when the condition in its argument is true 
and zero otherwise. 

If h(D k , D/) > n(Di, D fe ), then D fe is said to dominate with domination strength n(D fe , D/) — 
n(D;,D fc ). For simplicity we shall use the symbol > to denote domination, i.e., (D fc > Dj) = 1 
implies that D fe dominates Dj. 

E. Methodology for selection: a general approach 

For a given value of iV s , a methodology for identifying good A^-level FAIDs can now be 
devised based on the notions of decoder domination and the NCNVs. We remind the reader that 
the main goal of our approach is to be able to identify a small subset of candidate A^ s -level 
FAIDs, where each candidate FAID is potentially good on several codes. Let this small subset 
of selected FAIDs be denoted by T c . Ideally, if a candidate FAID could be selected solely based 
on how it dominates all the other FAIDs in J 7 , then one could possibly obtain an ordering of 
the FAIDs in T in terms of their dominance and conclude as to which ones are more likely to 
be good on a given code containing one or more of the TSs in A. Unfortunately, we have found 
that such an ordering does not exist since there can be many FAIDs that dominate a particularly 
good FAID (known a priori to be good) and yet perform poorly on certain codes. 

Therefore, without going into the details, we shall describe a general approach for selection 
that utilizes pre-determined small sets of good FAIDs and bad FAIDs denoted by T g and T\, 
respectively. The set T g consists of A^ s -level FAIDs that are known a priori to have good error 
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floor performance on several codes of different rates and possibly containing different TSs. The 
set Tb consists of iV s -level FAIDs that were found to perform well on one particular code but 
perform poorly on other codes. We regard FAIDs in Tb to be bad since our goal is to identify 
FAIDs that are capable of surpassing BP on several codes. 

We then evaluate whether a particular FAID D fc E T dominates or is dominated by the FAIDs 
in the sets T g and By using the sets T g and to compare with, we are inherently trying 
to select FAIDs whose NCNVs have characteristics similar to the NCNVs of FAIDs in JF g but 
dissimilar to the NCNVs of the FAIDs in Tb- Therefore, we define a cost function, that 
is based on domination strengths, and whose value determines whether the FAID should 
be accepted for inclusion into T c - We have observed that it is crucial for a candidate FAID to 
dominate most (or all) FAIDs in T g and also not be dominated by most (or all) FAIDs in T\> 
for it to be considered potentially good. This is reflected in the cost function ^ defined below. 

# fi (D fc ) = Yl (^ Dfc > D <) - ™( D " Dfe )) + 2 (^ Dfc ' - ™( D i> Dfc )) 

V D;S.F a , (D fc >Di)=l V D^e-Fb, (D fc >D i )=l 

(n(Di,D fc )-n(D fc ,Di))- ^ (n(D,-, D fc ) - n(D fc , D, 

V DiS^a, (Di>D fc )=l V DjE-Fj,, (D j >D fc )=l 

(5) 

The value of the cost function ^ is compared to a threshold r. If ^(D fc ) > r, then the 
FAID D fc is selected as a candidate to be included in T c , else it is rejected. The cardinality of 
T c depends on r since a smaller r accepts more FAIDs and a larger r accepts less FAIDs. The 
choice of Nj also plays a role and should generally be chosen to be small (5 to 10 iterations). 

Note that the approach we have presented in this paper is slightly different from the one 



proposed in [14|. In [14|, the selection algorithm assumes it has no a priori knowledge on the 
sets T g and F h , and then tries to progressively build the sets before using them to identify good 
candidate FAIDs. By instead utilizing pre-determined sets of T g and Tb in our approach in this 
paper, we have found that the selection procedure is greatly improved and we were able to 
obtain much better sets of candidate FAIDs T c (in terms of their error floor performance). Note 
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however that the approach of fl4| is still applicable to the selection method presented in this 
paper as it could still be used as an initial step for determining the sets T g and T^. 

Using our methodology, we were able to derive a set of good candidate 7-level FAIDs (which 
are 3-bit precision decoders) for column-weight-three codes. On a variety of codes of different 
rates and lengths, particularly good 7-level FAIDs chosen from T c all outperformed the BP 
(floating-point) in the error floor. Moreover, the loss in the waterfall compared to BP was found 
to be very reasonable. The numerical results to support this statement are provided in the next 
section. Another interesting remark related to our selection procedure that we have found is that, 
although the DE threshold values were not at all used as parameters in the selection of FAIDs, 
the candidate FAIDs that we obtained in set T c were all found to have fairly good DE thresholds. 

VI. Numerical results 



Earlier in Section IV we demonstrated the capability of 5-level and 7-level FAIDs to outper- 
form BP in the error floor on the (155,64) Tanner code. We now provide additional numerical 
results on the BSC to further illustrate the efficacy of FAIDs on column-weight-three codes of 
higher practical interest and validate our approach for decoder selection. The three codes used 
for the simulations were chosen to cover a broad variety of LDPC codes in terms of rate, length, 
and structure. They are: 1) an R — 0.751 (2388, 1793) structured code based on latin squares, 
2) an R = 0.5 (504,252) code, and 3) an R = 0.833 (5184,4322) quasi-cyclic code. 

The (2388, 1793) structured code with girth-8 was designed using the method of Nguyen et. al 
(6j, which is based on latin squares and avoids certain harmful trapping sets in the code design. 
The R = 0.5 (504, 252) code with girth-8 was designed using the progressive edge-growth 



(PEG) method of [ 30 1 while ensuring that it contains no (5,3) TS (see [27 1 for the topology). 
The (5184,4322) quasi-cyclic code is a high-rate girth-8 code with a minimum distance of 12. 

Figures |4j [6j and [5] show the frame error-rate (FER) performance comparisons versus the cross- 
over probability a between the particularly good 7-level (3-bit precision) FAIDs we identified and 
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the BP (floating-point). Table defines the FAID used on the (2388, 1793) and the (5184,4322) 
codes, while Table [n] defines the FAID used on the (504, 252) PEG-based code. Note that both 
are 7-level NLT FAIDs and all decoders were run for a maximum of 100 iterations. 

In all three codes, the 7-level FAIDs begin to surpass the BP at an FER~ 10~ 5 . Also notice the 
difference in the better slopes of the error floor for the 7-level FAIDs which can be attributed 
to their enhanced guaranteed error correction capability. For instance, all FAIDs used on the 
(155,64) Tanner code in Fig. [T] guarantee a correction of 5 errors, whereas BP fails to correct 
several 5-error patterns. It must also be noted that the good 7-level FAIDs identified using our 
approach outperformed BP on several other tested codes as well. Therefore, the methodology is 
applicable to any column-weight-three code and provides to an extent "universally" good FAIDs, 
as they are all capable of surpassing BP on not just few but several codes. 

VII. Conclusions 

We introduced a new paradigm for finite precision iterative decoding of LDPC codes on the 
BSC. Referred to as FAIDs, the newly proposed decoders use node update maps that are much 
simpler than BP yet capable of surpassing the floating-point BP with only three bits of precision. 
We described the general framework of FAIDs with focus on column-weight-three codes and 
provided examples of good 3-bit precision FAIDs. We also provided a general methodology to 
identify a set of "universally" good FAIDs, one or several of which are potentially good for any 
given column-weight-three code. Our methodology is thus not code-specific but rather utilizes 
the knowledge of harmful topologies that could be present in a given code. The supporting 
numerical results show that it is possible to achieve a much superior error-rate performance in 
the error floor at a much lower complexity and memory usage than BP by using FAIDs. 

Appendix 

Proof of Lemma 1: Assume that there exists an LT representation for such a $„, which 
is defined by assigning real values to the elements of the alphabet Ai, the threshold set & , 
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and the set y. Since $„(— C, — Lx,L 2 ) = —L x , the inequality Li — L 2 + C > T\ must hold. 
Also since Q v (—C, 0, L\) = 0, we have |C — Li| < 7\. Combining the two inequalities, we get 
Li-L 2 + C > |C— Li|. Now, if C > Li, then 2L X > L 2 , and if L x > C, then Li-L 2 + C > L\- 
C =4> 2C > L 2 2Li > L 2 . But since $„(-C, 0, -L 2 ) = -L 3 and $„(-C, -Li, -L x ) = -L 2 , 
we have L 2 + C > 2L\ + C =>• L 2 > 2Li, which is a contradiction. ■ 
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TABLE I 
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TABLE II 

LUT FOR <&„ OF A 7-LEVEL FAID WITH y { = -C 
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Cross-over probability (a) 



Fig. 1. Performance comparisons between the floating-point decoders: BP and min-sum , and the 3-bit precision decoders: 
5-level NLT, 7-level LT, and 7-level NLT FAIDs on the (155, 64) Tanner code. 




Fig. 2. A visualization of the plane partition as stacked boxes for the 7-level FAID whose <J>„ is described in Table [I] 
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Fig. 3. An example of a noisy T(6, 2) initialized by a vector ©. 
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Cross-over probability (a) 



Fig. 4. Performance comparisons between the BP (floating-point), the the 7-level FAID defined by Table [I] on the (2388, 1793) 
structured code. 
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Fig. 5. Performance comparisons between the BP (floating-point) and the 7-level FAID defined by Table |lT| on the (502, 252) 
quasi-cyclic code. 




Cross-over probability (a) 



Fig. 6. Performance comparisons between the BP (floating-point) and the 7-level FAID defined by Table[T]on the (5184, 4322) 
quasi-cyclic code. 
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